SEQUENCE LISTING 



10 



<110> Lyamichev, Victor 

<12 0> CHARGE TAGS AND SEPARATION OF NUCLEIC ACID MOLECULES 

<130> FORS 4912 

<160> 85 

<170> Patentln version 3.0 

<210> 1 

<211> 23 

<212> DNA 

15 <213> synthetic 

<220> 

<221> misc_f eature 

<222> (1) . . (2) 

20 

<223> misc. feature 

<400> 1 

2 5-^ tntcttttca ccagcgagac ggg 

-::J <210> 2 

S.s <211> 22 

3Q/* <212> DNA 

^ <213> synthetic 

<400> 2 

X attgggcgcc agggtggttt tt 

35^* 



7, 



<210> 3 

<211> 30 

<212> DNA 

46^ <213> synthetic 

© <400> 3 



45 



cagggtgaag ggaagaagaa agcgaaaggt 



<210> 4 

<211> 30 

<212> DNA 

<213> synthetic 

50 <400> 4 

cagggggaag ggaagaagaa agcgaaaggt 



55 <210> 5 

<211> 28 

<212> DNA 

60 

<213> synthetic 



94 



<400> 5 

cacgaattcc gaggcgatgc ttccgctc 

<210> 6 
<211> 30 
<212> DNA 
<213> synthetic 
<400> 6 

tcgacgtcga ctaacccttg gcggaaagcc 

<210> 7 
<211> 23 
<212> DNA 
<213> synthetic 
<400> 7 

gcatcgcctc ggaattcatg gtc 

<210> 8 
<211> 836 
<212> PRT 

<213> Thermus thermophilus 
<400> 8 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 

85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 HO 



Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 

165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 

245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 

325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 

405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



Glu Ala Thr Gly Val 
450 

Leu Glu Leu Ala Glu 
465 

Leu Ala Gly His Pro 

485 

Val Leu Phe Asp Glu 
500 

Thr Gly Lys Arg Ser 
515 

Ala His Pro He Val 
530 

Leu Lys Asn Thr Tyr 
545 

Thr Gly Arg Leu His 

565 

Arg Leu Ser Ser Ser 
580 

Pro Leu Gly Gin Arg 
595 

Ala Leu Val Ala Leu 
610 

His Leu Ser Gly Asp 
625 

Asp He His Thr Gin 

645 

Ala Val Asp Pro Leu 
660 

Val Leu Tyr Gly Met 
675 

Pro Tyr Glu Glu Ala 
690 

Pro Lys Val Arg Ala 
705 

Arg Gly Tyr Val Glu 

725 

Leu Asn Ala Arg Val 
740 

Phe Asn Met Pro Val 
755 

Met Val Lys Leu Phe 
770 



Arg Arg Asp Val Ala Tyr 
455 

Glu He Arg Arg Leu Glu 
470 475 

Phe Asn Leu Asn Ser Arg 

490 

Leu Arg Leu Pro Ala Leu 
505 

Thr Ser Ala Ala Val Leu 
520 

Glu Lys He Leu Gin His 
535 

Val Asp Pro Leu Pro Ser 
550 555 

Thr Arg Phe Asn Gin Thr 

570 

Asp Pro Asn Leu Gin Asn 
585 

He Arg Arg Ala Phe Val 
600 

Asp Tyr Ser Gin He Glu 
615 

Glu Asn Leu He Arg Val 
630 635 

Thr Ala Ser Trp Met Phe 

650 

Met Arg Arg Ala Ala Lys 
665 

Ser Ala His Arg Leu Ser 
680 

Val Ala Phe He Glu Arg 
695 

Trp He Glu Lys Thr Leu 
710 715 

Thr Leu Phe Gly Arg Arg 

730 

Lys Ser Val Arg Glu Ala 
745 

Gin Gly Thr Ala Ala Asp 
760 

Pro Arg Leu Arg Glu Met 
775 



Leu Gin Ala Leu Ser 
460 

Glu Glu Val Phe Arg 

480 

Asp Gin Leu Glu Arg 
495 

Gly Lys Thr Gin Lys 
510 

Glu Ala Leu Arg Glu 
525 

Arg Glu Leu Thr Lys 
540 

Leu Val His Pro Arg 

560 

Ala Thr Ala Thr Gly 
575 

He Pro Val Arg Thr 
590 

Ala Glu Ala Gly Trp 
605 

Leu Arg Val Leu Ala 
620 

Phe Gin Glu Gly Lys 

640 

Gly Val Pro Pro Glu 
655 

Thr Val Asn Phe Gly 
670 

Gin Glu Leu Ala He 
685 

Tyr Phe Gin Ser Phe 
700 

Glu Glu Gly Arg Lys 

720 

Arg Tyr Val Pro Asp 
735 

Ala Glu Arg Met Ala 
750 

Leu Met Lys Leu Ala 
765 

Gly Ala Arg Met Leu 
780 



Leu Gin Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
5 805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

10 Ser Ala Lys Gly 

835 



15 


<210> 9 
<211> 2511 














<212> DNA 














20 


<213> Thermus thermophilus 












<400> 9 














2 5:^ 


atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac 


gagccggggc 


120 




gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 




gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 




tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 




ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 




atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 




cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 




cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


45 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 




gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


50 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta gggccttcct ggagaggctg 


840 




gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


55 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg tcctctcccg ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


gccgggtgca 


ccgggcagca 


1020 




gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg gcctcctcgc 


caaggacctc 


1080 


60 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 




gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 



98 




99 



<212> PRT 

<213> Thermus thermophilus 
<400> 11 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 

85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 

165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 

245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



100 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 

325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 

405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 

485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



101 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 

645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arq Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 

725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 

805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 12 
<211> 2511 
<212> DNA 

<213> Thermus thermophilus 
<400> 12 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 



102 



gcggacgacg ttctcgccac cctggccaag 
atcctcaccg ccgaccgcga cctctaccaa 
5 cccgagggcc acctcatcac cccggagtgg 

cagtgggtgg acttccgcgc cctcgtgggg 
ggcatcgggg agaagaccgc cctcaagctc 

10 

ctcaagaacc tggaccgggt aaagccagaa 
gaagacctca ggctctcctt ggagctctcc 
15 gacctcgccc aggggcggga gcccgaccgg 
gagttcggca gcctcctcca cgagttcggc 
gccccctggc ccccgccgga aggggccttc 

20 

atgtgggcgg agcttaaagc cctggccgcc 
gaccccttgg cggggctaaa ggacctcaag 
25^ gccgtcttgg cctcgaggga ggggctagac 
gcctacctcc tggacccctc caacaccacc 
. 1 gagtggacgg aggacgccgc ccaccgggcc 
% 4 cttaagcgcc tcgaggggga ggagaagctc 
l\l ctctcccggg tcctggccca catggaggcc 
35Q caggcccttt ccctggagct tgcggaggag 
E! ttggcgggcc accccttcaa cctcaactcc 
U gagcttaggc ttcccgcctt ggggaagacg 
f$\ gcggtgctgg aggccctacg ggaggcccac 
gagctcacca agctcaagaa cacctacgtg 
45 acgggccgcc tccacacccg cttcaaccag 

tccgacccca acctgcagaa catccccgtc 
gccttcgtgg ccgaggcggg ttgggcgttg 

50 

cgcgtcctcg cccacctctc cggggacgaa 
gacatccaca cccagaccgc aagctggatg 
55 ctgatgcgcc gggcggccaa gacggtgaac 

aggctctccc aggagcttgc catcccctac 
ttccaaagct tccccaaggt gcgggcctgg 

60 

cggggctacg tggaaaccct cttcggaaga 



aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


ctttgggaga 


agtacggcct 


caggccggag 


540 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gaggggctta gggccttcct 


ggagaggctg 


840 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


tgcagggacg 


gccgggtgca 


ccgggcagca 


1020 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


accggggtac 


ggcgggacgt 


qqcctacctt 


1380 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


caaaagacag gcaagcgctc 


caccagcgcc 


1560 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acggccacgg 


ccacggggag 


qcttagtagc 


1740 


cgcaccccct 


tgggccagag 


qatccgccgg 


1800 


gtggccctgg 


actatagcca 


qatagagctc 

ZJ -J —J 


1860 


aacctgatca 


gggtcttcca 


gg a gggg aa g 


1920 


ttcggcgtcc 


ccccggaggc 


cqtggacccc 


1980 


ttcggcgtcc 


tctacggcat 


guccgcccai- 


A U *± U 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


aggcgctacg tgcccgacct 


caacgcccgg 


2220 



103 





qtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 




gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


5 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 




gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


10 


ctggaggtgg 
<210> 13 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtta 


g 


2511 


15 


<211> 58 
<212> DNA 
















<213> synthetic 












20 


<400> 13 
















tgcctgcagg 


tcgacgctag 


ctagtggtgg 


tggtggtggt 


gacccttggc 


ggaaagcc 


58 


25-.*. 


<210> 14 
















<211> 2526 












3c Ci 


<212> DNA 

<213> Thermus thermophilic 












<400> 14 
















atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 




ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac 


gagccggggc 


120 




gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 




tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


A R 

■4: O 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 




gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


50 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


cacrqccqqaq 


540 




cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


55 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 

U U w 




ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


60 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 



104 





gagttcggca gcctcctcca 


prrarrt" i~ (Tine* 

oydy l. Ltyyo 




gccccctggc 


ccccgccgga 


ci.y y y y o <^ u u ^ 


r- 

5 


atgtgggcgg 


agcttaaagc 


r 1 1 - aa c c a cc 




gaccccttgg 


cggggctaaa 


yyd^u u cicxy 


10 


gccgtcttgg 


cctcgaggga 


y y y y y 


gcctacctcc 


tggacccctc 






gagtggacgg 


aggacgccgc 




15 


cttaagcgcc 


tcgaggggga 


y y a y ^^-y 




ctctcccggg 


tcctggccca 


CatyydyyLL 


20 


caggcccttt 


ccctggagct 


tycyydyyciy 


ttggcgggcc 


accccttcaa 






gagcttaggc 


ttcccgcctt 


y y y y ^ ^y ^ ° y 


2E^ 


gcggtgctgg 


aggccctacg 


g y ay y c- t^o a. u 




gagctcacca 


agctcaagaa 


CaCC Lacyug 


3 0,/: 


aCQQQCCQCC 


tccacacccg 


CEucaaccag 


tccgacccca 


acctgcagaa 


catzccccgcc 




gccttcgtgg 


ccgaggcggg 


^ t 999 c 9 T: tg 


3 5 - " 


cgcgtcctcg 


cccacctctc 


c yyyy ac y aa 




gacatccaca 


cccagaccgc 


aayctyyaLy 


ii " 


ctgatgcgcc 


gggcggccaa 


ycn_-yy l. y dcto 


aggctctccc 


aggagcttgc 


CatCCCCCaC 




ttccaaagct 


tccccaaggt 


9 c 999 ccc yy 


45 


cggggctacg 


tggaaaccct 


4— -f— r~*</—*r — i " 1 ^ 

Cttuyyadya 




gtgaagagcg 


tcagggaggc 


C y c y9 d -y c -y u 


50 


gccgccgacc 


tcatgaagct 


cgccatggtg 


gcccgcatgc 


tcctccaggt 


ccacaacgag 




gaggaggtgg 


cggctttggc 


caaggaggcc 


55 


ctggaggtgg 
caccac 


aggtggggat 


gggggaggac 



60 <210> 15 

<211> 842 



ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


tgcagggacg gccgggtgca 


ccgggcagca 


1020 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


ctcctctcgg 


agaggctcca 


tcgqaacctc 


1260 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


atccgccgcc 


tcgaggagga 


qqtcttccgc 


1440 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


ttcggcgtcc 


ccccggaggc 


cqtgqacccc 


1980 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


gaggaggcgg 


tggcctttat 


aqaqcqctac 


2100 


atagaaaaga 


ccctggagga 


qqqgaggaag 


2160 


aggcgctacg 


tgcccgacct 


caacqcccqq 


2220 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 
2526 
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<212> PRT 

<213> Thermus thermophilus 
<400> 15 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 

85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 

165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 

245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 
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Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 

325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 

405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 

485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 
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Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 

645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arq Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 

725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 

805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


16 


<211> 


31 


<212> 


DNA 


<213> 


synthetic 


<400> 


16 


gcctgcaggg gcggccgcgt 


<210> 


17 


<211> 


26 


<212> 


DNA 


<213> 


synthetic 


<400> 


17 
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ctcctggacc cttcgaacac cacccc 



<210> 18 

<211> 23 

<212> DNA 

<213> synthetic 

<400> 18 



gtcctggccc 


atatggaggc 


cac 


<210> 19 






<211> 2526 




<212> DNA 






<213> Thermus thermophilus 


V *± \J U <- > -1 ' 








3 crcr r cr a fc cr c fc 


tCCQCtCttt 

V—^ W 1 w 




1~ cfcrpr't" area 


caccttcttc 


gaaccggtgc 


aggeggtcta 


cggcttcgcc 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


tacgaggcct 


acaaggeggg 


gagggccccg 


ctcatcaagg 


agctggtgga 


cctcctgggg 


gcggacgacg 


ttctcgccac 


cctggccaag 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


cccgagggcc 


acctcatcac 


cccggagtgg 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


ggcatcgggg 


agaagacege 


cctcaagctc 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


gaagacctca 


ggctctcctt 


ggagctctcc 


gacctcgccc 


aggggeggga 


gcccgaccgg 


gagttcggca 


gcctcctcca 


egagttegge 


gccccctggc 


ccccgccgga 


aggggectte 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


gaccccttgg 


eggggctaaa 


ggacctcaag 



gaacccaaag gccgggtcct cctggtggac 
gecctgaagg gcctcaccac gagcegggge 
aagagcctcc tcaaggccct gaaggaggac 
gccaaggccc cctccttccg ccacgaggcc 
acccccgagg acttcccccg gcagctcgcc 
tttacccgcc tcgaggtccc eggctacgag 
aaggcggaaa aggaggggta egaggtgege 
ctcgtctccg accgcgtcgc cgtcctccac 
ctttgggaga agtaeggect caggceggag 
gacccctccg acaacctccc eggggtcaag 
ctcaaggagt ggggaagect ggaaaacctc 
aacgtccggg agaagatcaa ggcccacctg 
egggtgegea ccgacctccc cctggaggtg 
gaggggctta gggccttcct ggagaggctg 
ctcctggagg cccccgcccc cctggaggag 
gtgggcttcg tcctctcccg ccccgagccc 
tgcaggggcg gccgcgtgca ccgggcagca 
gaggtceggg gcctcctcgc caaggacctc 
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gccgtcttgg cctcgaggga ggggctagac 
gcctacctcc tggacccttc gaacaccacc 
gagtggacgg aggacgccgc ccaccgggcc 
cttaagcgcc tcgaggggga ggagaagctc 
ctctcccggg tcctggccca tatggaggcc 
caggcccttt ccctggagct tgcggaggag 
ttggcgggcc accccttcaa cctcaactcc 
gagcttaggc ttcccgcctt ggggaagacg 
gcggtgctgg aggccctacg ggaggcccac 
gagctcacca agctcaagaa cacctacgtg 
acgggccgcc tccacacccg cttcaaccag 
tccgacccca acctgcagaa catccccgtc 
gccttcgtgg ccgaggcggg ttgggcgttg 
cgcgtcctcg cccacctctc cggggacgaa 
gacatccaca cccagaccgc aagctggatg 
ctgatgcgcc gggcggccaa gacggtgaac 
aggctctccc aggagcttgc catcccctac 
ttccaaagct tccccaaggt gcgggcctgg 
cggggctacg tggaaaccct cttcggaaga 
gtgaagagcg tcagggaggc cgcggagcgc 
gccgccgacc tcatgaagct cgccatggtg 
gcccgcatgc tcctccaggt ccacaacgag 
gaggaggtgg cggctttggc caaggaggcc 
ctggaggtgg aggtggggat gggggaggac 
caccac 

<210> 20 
<211> 842 
<212> PRT 

<213> Thermus thermophilus 
<400> 20 

Met Asn Ser Glu Ala Met Leu Pro 



ctcgtgcccg gggacgaccc catgctcctc 
cccgaggggg tggcgcggcg ctacgggggg 
ctcctctcgg agaggctcca tcggaacctc 
ctttggctct accacgaggt ggaaaagccc 
accggggtac ggcgggacgt ggcctacctt 
atccgccgcc tcgaggagga ggtcttccgc 
cgggaccagc tggaaagggt gctctttgac 
caaaagacag gcaagcgctc caccagcgcc 
cccatcgtgg agaagatcct ccagcaccgg 
gaccccctcc caagcctcgt ccacccgagg 
acggccacgg ccacggggag gcttagtagc 
cgcaccccct tgggccagag gatccgccgg 
gtggccctgg actatagcca gatagagctc 
aacctgatca gggtcttcca ggaggggaag 
ttcggcgtcc ccccggaggc cgtggacccc 
ttcggcgtcc tctacggcat gtccgcccat 
gaggaggcgg tggcctttat agagcgctac 
atagaaaaga ccctggagga ggggaggaag 
aggcgctacg tgcccgacct caacgcccgg 
atggccttca acatgcccgt ccagggcacc 
aagctcttcc cccgcctccg ggagatgggg 
ctcctcctgg aggcccccca agcgcgggcc 
atggagaagg cctatcccct cgccgtgccc 
tggctttccg ccaagggtca ccaccaccac 



Leu Phe Glu Pro Lys Gly Arg Val 
110 



15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 

85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 

165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 

245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 

325 330 335 
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His Arg Ala Ala 
340 

Arg Gly Leu Leu 
355 

Leu Asp Leu Val 
370 

Asp Pro Ser Asn 
385 

Glu Trp Thr Glu 



His Arg Asn Leu 
420 

Leu Tyr His Glu 
435 

Glu Ala Thr Gly 
450 

Leu Glu Leu Ala 
465 

Leu Ala Gly His 



Val Leu Phe Asp 
500 

Thr Gly Lys Arg 
515 

Ala His Pro He 
530 

Leu Lys Asn Thr 
545 

Thr Gly Arg Leu 



Arg Leu Ser Ser 
580 

Pro Leu Gly Gin 
595 

Ala Leu Val Ala 
610 

His Leu Ser Gly 
625 

Asp He His Thr 



Ala Val Asp Pro 
660 



Asp Pro Leu Ala 



Ala Lys Asp Leu 
360 

Pro Gly Asp Asp 
375 

Thr Thr Pro Glu 
390 

Asp Ala Ala His 
405 

Leu Lys Arg Leu 



Val Glu Lys Pro 
440 

Val Arg Arg Asp 
455 

Glu Glu He Arg 
470 

Pro Phe Asn Leu 
485 

Glu Leu Arg Leu 



Ser Thr Ser Ala 
520 

Val Glu Lys He 
535 

Tyr Val Asp Pro 
550 

His Thr Arg Phe 
565 

Ser Asp Pro Asn 



Arg He Arg Arg 
600 

Leu Asp Tyr Ser 
615 

Asp Glu Asn Leu 
630 

Gin Thr Ala Ser 
645 

Leu Met Arg Arg 



Gly Leu Lys Asp 
345 

Ala Val Leu Ala 



Pro Met Leu Leu 
380 

Gly Val Ala Arg 
395 

Arg Ala Leu Leu 
410 

Glu Gly Glu Glu 
425 

Leu Ser Arg Val 



Val Ala Tyr Leu 
460 

Arg Leu Glu Glu 
475 

Asn Ser Arg Asp 
490 

Pro Ala Leu Gly 
505 

Ala Val Leu Glu 



Leu Gin His Arg 
540 

Leu Pro Ser Leu 
555 

Asn Gin Thr Ala 
570 

Leu Gin Asn He 
585 

Ala Phe Val Ala 



Gin He Glu Leu 
620 

He Arg Val Phe 
635 

Trp Met Phe Gly 
650 

Ala Ala Lys Thr 
665 



Leu Lys Glu Val 
350 

Ser Arg Glu Gly 
365 

Ala Tyr Leu Leu 



Arg Tyr Gly Gly 
400 

Ser Glu Arg Leu 
415 

Lys Leu Leu Trp 
430 

Leu Ala His Met 
445 

Gin Ala Leu Ser 



Glu Val Phe Arg 
480 

Gin Leu Glu Arg 
495 

Lys Thr Gin Lys 
510 

Ala Leu Arg Glu 
525 

Glu Leu Thr Lys 



Val His Pro Arg 
560 

Thr Ala Thr Gly 
575 

Pro Val Arg Thr 
590 

Glu Ala Gly Trp 
605 

Arg Val Leu Ala 



Gin Glu Gly Lys 
640 

Val Pro Pro Glu 
655 

Val Asn Phe Gly 
670 
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Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arq Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 

725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 

805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 21 
<211> 30 
<212> DNA 
<213> synthetic 
<400> 21 

caggaggagc tcgttggcga cctggaggag 

<210> 22 
<211> 2526 
<212> DNA 

<213> Thermus thermophilus 
<400> 22 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
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gggtacaagg ccgtcttcgt ggtctttgac 
tacgaggcct acaaggcggg gagggccccg 
5 ctcatcaagg agctggtgga cctcctgggg 

gcggacgacg ttctcgccac cctggccaag 
atcctcaccg ccgaccgcga cctctaccaa 

10 

cccgagggcc acctcatcac cccggagtgg 
cagtgggtgg acttccgcgc cctcgtgggg 
15 ggcatcgggg agaagaccgc cctcaagctc 
ctcaagaacc tggaccgggt aaagccagaa 
gaagacctca ggctctcctt ggagctctcc 

20 

gacctcgccc aggggcggga gcccgaccgg 
gagttcggca gcctcctcca cgagttcggc 
2%&: gccccctggc ccccgccgga aggggccttc 
k 3 atgtgggcgg agcttaaagc cctggccgcc 
gaccccttgg cggggctaaa ggacctcaag 
'2 gccgtcttgg cctcgaggga ggggctagac 
Ly gcctacctcc tggacccttc gaacaccacc 
3£^ gagtggacgg aggacgccgc ccaccgggcc 
O cttaagcgcc tcgaggggga ggagaagctc 
p5 ctctcccggg tcctggccca tatggaggcc 

40:;; 

^ caggcccttt ccctggagct tgcggaggag 
L-jj. ttggcgggcc accccttcaa cctcaactcc 
45 gagcttaggc ttcccgcctt ggggaagacg 

gcggtgctgg aggccctacg ggaggcccac 
gagctcacca agctcaagaa cacctacgtg 

50 

acgggccgcc tccacacccg cttcaaccag 
tccgacccca acctgcagaa catccccgtc 
55 gccttcgtgg ccgaggcggg ttgggcgttg 
cgcgtcctcg cccacctctc cggggacgaa 
gacatccaca cccagaccgc aagctggatg 

60 

ctgatgcgcc gggcggccaa gacggtgaac 



gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


ctcqtctccg 


accgcgtcgc 


cgtcctccac 


480 


ctttqqgaga 


agtacggcct 


caggccggag 


540 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gaggggctta gggccttcct 


ggagaggctg 


840 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


tgcaggggcg gccgcgtgca 


ccgggcagca 


1020 


gaggtccggg 


gcctcctcgc 


caaqqacctc 


1080 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


accggggtac 


ggcgggacgt 


qgcctacctt 


1380 


atccgccgcc 


tcgaggagga 


qqtCttCCgC 


1440 


cgggaccagc 


tggaaagggt 


qctctttgac 


1500 


caaaagacag gcaagcgctc 


caccagcgcc 


1560 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acggccacgg 


ccacggggag gcttagtagc 


1740 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gtggccctgg 


actatagcca 


gatagagctc 


T 0 ez A 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 
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Leu Arg Pro Glu 
180 

Ser Asp Asn Leu 
195 

Lys Leu Leu Lys 
210 

Asp Arg Val Lys 
225 

Glu Asp Leu Arg 



Pro Leu Glu Val 
260 

Leu Arg Ala Phe 
275 

Phe Gly Leu Leu 
290 

Pro Pro Glu Gly 
305 

Met Trp Ala Glu 



His Arg Ala Ala 
340 

Arg Gly Leu Leu 
355 

Leu Asp Leu Val 
370 

Asp Pro Ser Asn 
385 

Glu Trp Thr Glu 



His Arg Asn Leu 
420 

Leu Tyr His Glu 
435 

Glu Ala Thr Gly 
450 

Leu Glu Leu Ala 
465 

Leu Ala Gly His 



Val Leu Phe Asp 
500 



Gin Trp Val Asp 



Pro Gly Val Lys 
200 

Glu Trp Gly Ser 
215 

Pro Glu Asn Val 
230 

Leu Ser Leu Glu 
245 

Asp Leu Ala Gin 



Leu Glu Arg Leu 
280 

Glu Ala Pro Ala 
295 

Ala Phe Val Gly 
310 

Leu Lys Ala Leu 
325 

Asp Pro Leu Ala 



Ala Lys Asp Leu 
360 

Pro Gly Asp Asp 
375 

Thr Thr Pro Glu 
390 

Asp Ala Ala His 
405 

Leu Lys Arg Leu 



Val Glu Lys Pro 
440 

Val Arg Arg Asp 
455 

Glu Glu He Arg 
470 

Pro Phe Asn Leu 
485 

Glu Leu Arg Leu 



Phe Arg Ala Leu 
185 

Gly He Gly Glu 



Leu Glu Asn Leu 
220 

Arg Glu Lys He 
235 

Leu Ser Arg Val 
250 

Gly Arg Glu Pro 
265 

Glu Phe Gly Ser 



Pro Leu Glu Glu 
300 

Phe Val Leu Ser 
315 

Ala Ala Cys Arg 
330 

Gly Leu Lys Asp 
345 

Ala Val Leu Ala 



Pro Met Leu Leu 
380 

Gly Val Ala Arg 
395 

Arg Ala Leu Leu 
410 

Glu Gly Glu Glu 
425 

Leu Ser Arg Val 



Val Ala Tyr Leu 
460 

Arg Leu Glu Glu 
475 

Asn Ser Arg Asp 
490 

Pro Ala Leu Gly 
505 



Val Gly Asp Pro 
190 

Lys Thr Ala Leu 
205 

Leu Lys Asn Leu 



Lys Ala His Leu 
240 

Arg Thr Asp Leu 
255 

Asp Arg Glu Gly 
270 

Leu Leu His Glu 
285 

Ala Pro Trp Pro 



Arg Pro Glu Pro 
320 

Gly Gly Arg Val 
335 

Leu Lys Glu Val 
350 

Ser Arg Glu Gly 
365 

Ala Tyr Leu Leu 



Arg Tyr Gly Gly 
400 

Ser Glu Arg Leu 
415 

Lys Leu Leu Trp 
430 

Leu Ala His Met 
445 

Gin Ala Leu Ser 



Glu Val Phe Arg 
480 

Gin Leu Glu Arg 
495 

Lys Thr Gin Lys 
510 
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Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 

645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 

725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 

805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 
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<210> 24 

<211> 30 

5 

<212> DNA 

<213> synthetic 

10 <400> 24 

caggaggagc tcgttggcga cctggaggag 

15 <210> 25 

<211> 2526 

<212> DNA 

20 

<213> Thermus thermophilus 

<400> 25 



2S^ 


atgaattccg 


aggcgacgcu 




y cLd\^ k> ciCT.a.y 


ac pcracrt c c fc 

>-H ^ *— ^ ^— ' 


c c t crcr t crcrac 


60 




ggccaccacc 


tggcctaccg 


cacctxcuxc 


^-v- /-i +- /-T a a t~vrt 

gccctyddyy 






12 0 


3Q/1 


gaaccggtgc 


aggcggtcta 


cggcu ucgcc 


aagay cc ucl 




era acrcfacrcra.G 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


ii ■• s 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


3^ 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


eggctacgag 


360 




gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


egaggtgege 


420 


ii ;:=s s 

4 tiff 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggceggag 


540 




cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


eggggtcaag 


600 


45 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 




ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


50 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 




gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


55 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 




atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


60 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg gggacgaccc 


catgctcctc 


1140 
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err rt*acctcc 


tacracccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


etaegggggg 


1200 




aaataaacQQ 


aqqacqccqc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 




cttaacrcQCC 


tcqaqqqqqa 


qgagaagctc 


ctttggctct 


accacgaggt 


ggaaaagece 


1320 




ctctcccggg 


tcctggccca 


tatggaggee 


aceggggtae 


ggcgggacgt 


ggcctacctt 


1380 


10 


caaqcccttt 


ccctggagct 


tgeggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


1 1 era c Qcrcr c c 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 




aa err 1 1" 1" aacrc 


ttcccgcctt 


qaaqaaqacg 


aagaagacag gcaagcgctc 


caccagcgcc 


1560 




crraatcrctcrcr 


aaaccctacq 


qqaqqcccac 


cccatcgtgg agaagatcct 


ccagcaccgg 


1620 




cracrct caeca. 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


20 


accrcrcrccQCC 


tccacacccg 


cttcaaccag 


acggccacgg 


ecaeggggag 


gcttagtagc 


1740 




acctacaqaa 


catccccgtc 


cgcaccccct 


tgggecagag 


gatccgccgg 


1800 




a p p t" t ca t aa 


c CQaqqcaqq 


ttgggcgttg gtggccctgg actatageca gatagagctc 


1860 


9 R 


pcrpcrt - pp 1~ ccr 


cccacctctc 


eggggacgaa 


aacctgatca gggtcttcca 


ggaggggaag 


1920 




y Ct.\^CL L- w >w CI. \— ' 


cccacraccqc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


3 or\ 


p t* era t" a c crc c 


aaoccrcTC c aa 


gaeggtgaac 


ttcggcgtcc 


tetaeggcat 


gtccgcccat 


2040 


a or crp trtccc 


aaaaqcttqc 


catcccctac 


gaggaggegg 


tggectttat 


agagegctae 


2100 




t~t" PPaaaCfCt 


tccccaaqqt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 




pcrcrcrcrcfcaccr 

^yyyy uLa ^y 


taqaaaccct 


etteggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


2220 




crt~ aaaaacrcci 


t caqqqaqqc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 




gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


O A C\ C\ 

z *± u u 




gaggaggtgg 


eggctttgge 


caaggaggee 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


45 


ctggaggtgg 
caccac 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 
2526 



50 <210> 26 

<211> 842 
<212> PRT 

55 

<213> Thermus thermophilics 
<400> 26 

60 Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 

15 10 15 
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Leu Leu Val Asp 
20 

Lys Gly Leu Thr 
35 

Phe Ala Lys Ser 
50 

Val Phe Val Val 
65 

Tyr Glu Ala Tyr 



Arg Gin Leu Ala 
100 

Arg Leu Glu Val 
115 

Ala Lys Lys Ala 
130 

Asp Arg Asp Leu 
145 

Pro Glu Gly His 



Leu Arg Pro Glu 
180 

Ser Asp Asn Leu 
195 

Lys Leu Leu Lys 
210 

Asp Arg Val Lys 
225 

Glu Asp Leu Arg 



Pro Leu Glu Val 
260 

Leu Arg Ala Phe 
275 

Phe Gly Leu Leu 
290 

Pro Pro Glu Gly 
305 

Met Trp Ala Glu 



His Arg Ala Ala 
340 



Gly His His Leu 



Thr Ser Arg Gly 
40 

Leu Leu Lys Ala 
55 

Phe Asp Ala Lys 
70 

Lys Ala Gly Arg 
85 

Leu lie Lys Glu 



Pro Gly Tyr Glu 
120 

Glu Lys Glu Gly 
135 

Tyr Gin Leu Val 
150 

Leu lie Thr Pro 
165 

Gin Trp Val Asp 



Pro Gly Val Lys 
200 

Glu Trp Gly Ser 
215 

Pro Glu Asn Val 
230 

Leu Ser Leu Glu 
245 

Asp Leu Ala Gin 



Leu Glu Arg Leu 
280 

Glu Ala Pro Ala 
295 

Ala Phe Val Gly 
310 

Leu Lys Ala Leu 
325 

Asp Pro Leu Ala 



Ala Tyr Arg Thr 
25 

Glu Pro Val Gin 



Leu Lys Glu Asp 
60 

Ala Pro Ser Phe 
75 

Ala Pro Thr Pro 
90 

Leu Val Asp Leu 
105 

Ala Asp Asp Val 



Tyr Glu Val Arg 
140 

Ser Asp Arg Val 
155 

Glu Trp Leu Trp 
170 

Phe Arg Ala Leu 
185 

Gly He Gly Glu 



Leu Glu Asn Leu 
220 

Arg Glu Lys He 
235 

Leu Ser Arg Val 
250 

Gly Arg Glu Pro 
265 

Glu Phe Gly Ser 



Pro Leu Glu Glu 
300 

Phe Val Leu Ser 
315 

Ala Ala Cys Arg 
330 

Gly Leu Lys Asp 
345 



Phe Phe Ala Leu 
30 

Ala Val Tyr Gly 
45 

Gly Tyr Lys Ala 



Arg His Glu Ala 
80 

Glu Asp Phe Pro 
95 

Leu Gly Phe Thr 
110 

Leu Ala Thr Leu 
125 

He Leu Thr Ala 



Ala Val Leu His 
160 

Glu Lys Tyr Gly 
175 

Val Gly Asp Pro 
190 

Lys Thr Ala Leu 
205 

Leu Lys Asn Leu 



Lys Ala His Leu 
240 

Arg Thr Asp Leu 
255 

Asp Arg Glu Gly 
270 

Leu Leu His Glu 
285 

Ala Pro Trp Pro 



Arg Pro Glu Pro 
320 

Gly Gly Arg Val 
335 

Leu Lys Glu Val 
350 



120 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 

405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 

485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 

645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 
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Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 

725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 

805 810 815 





820 








825 




830 




Ser Ala Lys 
835 


Gly 


His 


His 


His His 
840 


His His 








<210> 27 


















<211> 340 


















<212> PRT 


















<213> Pyrococcus furiosus 










<400> 27 


















Met Gly Val 
1 


Pro 


He 
5 


Gly 


Glu He 


He Pro 
10 


Arg Lys Glu 


He 


Glu 
15 


Glu Asn Leu 


Tyr 
20 


Gly 


Lys 


Lys He 


Ala He 
25 


Asp Ala Leu 


Asn 
30 


Ala 


Tyr Gin Phe 
35 


Leu 


Ser 


Thr 


He Arg Gin Lys Asp Gly Thr 
40 45 


Pro 


Leu 


Asp Ser Lys 
50 


Gly 


Arg 


He 


Thr Ser 
55 


His Leu 


Ser Gly Leu 
60 


Phe 


Tyr 


Thr He Asn 
65 


Leu 


Met 


Glu 
70 


Ala Gly 


He Lys 


Pro Val Tyr 
75 


Val 


Phe 


Gly Glu Pro 


Pro 


Glu 
85 


Phe 


Lys Lys 


Lys Glu 
90 


Leu Glu Lys 


Arg Arg 
95 


Ala Arg Glu 


Glu 
100 


Ala 


Glu 


Glu Lys 


Trp Arg 
105 


Glu Ala Leu 


Glu 
110 


Lys 



80 
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Glu He Glu Glu Ala Arg Lys Tyr Ala Gin Arg Ala Thr Arg Val Asn 
115 120 125 

Glu Met Leu He Glu Asp Ala Lys Lys Leu Leu Glu Leu Met Gly He 
130 135 140 

Pro He Val Gin Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met 
145 150 155 160 

Ala Ala Lys Gly Ser Val Tyr Ala Ser Ala Ser Gin Asp Tyr Asp Ser 

165 170 175 

Leu Leu Phe Gly Ala Pro Arg Leu Val Arg Asn Leu Thr He Thr Gly 
180 185 190 

Lys Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Glu He Lys Pro Glu 
195 200 205 

Leu He He Leu Glu Glu Val Leu Lys Glu Leu Lys Leu Thr Arg Glu 
210 215 220 

Lys Leu He Glu Leu Ala He Leu Val Gly Thr Asp Tyr Asn Pro Gly 
225 230 235 240 

Gly He Lys Gly He Gly Leu Lys Lys Ala Leu Glu He Val Arg His 

245 250 255 

Ser Lys Asp Pro Leu Ala Lys Phe Gin Lys Gin Ser Asp Val Asp Leu 
260 265 270 

Tyr Ala He Lys Glu Phe Phe Leu Asn Pro Pro Val Thr Asp Asn Tyr 
275 280 285 

Asn Leu Val Trp Arg Asp Pro Asp Glu Glu Gly He Leu Lys Phe Leu 
290 295 300 

Cys Asp Glu His Asp Phe Ser Glu Glu Arg Val Lys Asn Gly Leu Glu 
305 310 315 320 

Arg Leu Lys Lys Ala He Lys Ser Gly Lys Gin Ser Thr Leu Glu Ser 

325 330 335 

Trp Phe Lys Arg 
340 

<210> 28 
<211> 326 
<212> PRT 

<213> Methanococcus jannaschii 
<400> 28 

Met Gly Val Gin Phe Gly Asp Phe He Pro Lys Asn He He Ser Phe 
15 10 15 

Glu Asp Leu Lys Gly Lys Lys Val Ala He Asp Gly Met Asn Ala Leu 
20 25 30 

Tyr Gin Phe Leu Thr Ser He Arg Leu Arg Asp Gly Ser Pro Leu Arg 
35 40 45 
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Asn Arg Lys Gly Glu He Thr Ser Ala Tyr Asn Gly Val Phe Tyr Lys 
50 55 60 

Thr He His Leu Leu Glu Asn Asp He Thr Pro He Trp Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Lys Leu Lys Glu Lys Thr Arg Lys Val Arg Arg Glu 

85 90 95 

Met Lys Glu Lys Ala Glu Leu Lys Met Lys Glu Ala He Lys Lys Glu 
100 105 110 

Asp Phe Glu Glu Ala Ala Lys Tyr Ala Lys Arg Val Ser Tyr Leu Thr 
115 120 125 

Pro Lys Met Val Glu Asn Cys Lys Tyr Leu Leu Ser Leu Met Gly He 
130 135 140 

Pro Tyr Val Glu Ala Pro Ser Glu Gly Glu Ala Gin Ala Ser Tyr Met 
145 150 155 160 

Ala Lys Lys Gly Asp Val Trp Ala Val Val Ser Gin Asp Tyr Asp Ala 

165 170 175 

Leu Leu Tyr Gly Ala Pro Arg Val Val Arg Asn Leu Thr Thr Thr Lys 
180 185 190 

Glu Met Pro Glu Leu He Glu Leu Asn Glu Val Leu Glu Asp Leu Arg 
195 200 205 

He Ser Leu Asp Asp Leu He Asp He Ala He Phe Met Gly Thr Asp 
210 215 220 

Tyr Asn Pro Gly Gly Val Lys Gly He Gly Phe Lys Arg Ala Tyr Glu 
225 230 235 240 

Leu Val Arg Ser Gly Val Ala Lys Asp Val Leu Lys Lys Glu Val Glu 

245 250 255 

Tyr Tyr Asp Glu He Lys Arg He Phe Lys Glu Pro Lys Val Thr Asp 
260 265 270 

Asn Tyr Ser Leu Ser Leu Lys Leu Pro Asp Lys Glu Gly He He Lys 
275 280 285 

Phe Leu Val Asp Glu Asn Asp Phe Asn Tyr Asp Arg Val Lys Lys His 
290 295 300 

Val Asp Lys Leu Tyr Asn Leu He Ala Asn Lys Thr Lys Gin Lys Thr 
305 310 315 320 

Leu Asp Ala Trp Phe Lys 



325 



<210> 



29 



<211> 



328 



<212> 



PRT 



<213> 



Methanobacterium thermoautotrophicum 
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<400> 29 

Met Gly Val Lys Leu Arg Asp Val Val Ser Pro Arg Arg He Arg Leu 
15 10 15 

Glu Asp Leu Arg Gly Arg Thr Val Ala Val Asp Ala Ala Asn Thr Leu 
20 25 30 

Tyr Gin Phe Leu Ser Ser He Arg Gin Arg Asp Gly Thr Pro Leu Met 
35 40 45 

Asp Ser Arg Gly Arg Val Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Thr Ala Ala Val Met Glu Arg Glu He Arg Val He Tyr Val Phe Asp 
65 70 75 80 

Gly Arg Ser His His Leu Lys Gly Glu Thr Val Ser Arg Arg Ala Asp 

85 90 95 

He Arg Lys Lys Ser Glu Val Glu Trp Lys Arg Ala Leu Glu Glu Gly 
100 105 110 

Asp He Asp Arg Ala Arg Lys Tyr Ala Val Arg Ser Ser Arg Met Ser 
115 120 125 

Ser Glu He Leu Glu Ser Ser Lys Arg Leu Leu Glu Leu Leu Gly He 
130 135 140 

Pro Tyr Val Gin Ala Pro Gly Glu Gly Glu Ala Gin Ala Ser Tyr Met 
145 150 155 160 

Val Lys Met Gly Asp Ala Trp Ala Val Ala Ser Gin Asp Tyr Asp Cys 

165 170 175 

Leu Leu Phe Gly Ala Pro Arg Val Val Arg Lys Val Thr Leu Ser Gly 
180 185 190 

Lys Leu Glu Asp Pro His He He Glu Leu Glu Ser Thr Leu Arg Ala 
195 200 205 

Leu Ser He Ser His Thr Gin Leu Val Asp Met Ala Leu Leu Val Gly 
210 215 220 

Thr Asp Phe Asn Glu Gly Val Lys Gly Tyr Gly Ala Arg Arg Gly Leu 
225 230 235 240 

Lys Leu He Arg Glu Lys Gly Asp He Phe Lys Val He Arg Asp Leu 

245 250 255 

Glu Ala Asp He Gly Gly Asp Pro Gin Val Leu Arg Arg He Phe Leu 
260 265 270 

Glu Pro Glu Val Ser Glu Asp Tyr Glu He Arg Trp Arg Lys Pro Asp 
275 280 285 

Val Glu Gly Val He Glu Phe Leu Cys Thr Glu His Gly Phe Ser Glu 
290 295 300 

Asp Arg Val Arg Asp Ala Leu Lys Lys Phe Glu Gly Ala Ser Ser Thr 
305 310 315 320 



125 



Gin Lys Ser Leu Glu Asp Trp Phe 

325 



<210> 


30 


<211> 


336 


<212> 


PRT 


<213> 


Afu 


<400> 


30 



Met Gly Ala Asp 
1 

Glu Tyr Phe Ser 
20 

Tyr Gin Phe lie 
35 

Asp Ser Gin Gly 
50 

Val Ser Asn Met 
65 

Gly Glu Pro Pro 



Arg Arg Ala Glu 
100 

Asp Lys Asp Ala 
115 

Tyr lie Val Asp 
130 

Phe Val Asp Ala 
145 

Ala Lys Gly Asp 



Leu Phe Gly Ser 
180 

Arg Lys Leu Pro 
195 

He He Leu Glu 
210 

Leu He Asp He 
225 

Lys Gly Val Gly 



He Gly Asp Leu 
5 

Gly Lys Lys He 



Ser He He Arg 
40 

Arg He Thr Ser 
55 

Val Glu Val Gly 
70 

Glu Phe Lys Lys 
85 

Ala Glu Glu Met 



Lys Lys Tyr Ala 
120 

Ser Ala Lys Thr 
135 

Pro Ser Glu Gly 
150 

Val Glu Tyr Thr 
165 

Pro Arg Leu Ala 



Gly Lys Asn Val 
200 

Ser Asn Leu Lys 
215 

Ala He Leu Val 
230 

Val Lys Lys Ala 



Phe Glu Arg Glu 
10 

Ala Val Asp Ala 
25 

Gin Pro Asp Gly 



His Leu Ser Gly 
60 

He Arg Pro Val 
75 

Ala Glu He Glu 
90 

Trp He Ala Ala 
105 

Gin Ala Ala Gly 



Leu Leu Ser Tyr 
140 

Glu Ala Gin Ala 
155 

Gly Ser Gin Asp 
170 

Arg Asn Leu Ala 
185 

Tyr Val Asp Val 



Arg Leu Gly Leu 
220 

Gly Thr Asp Tyr 
235 

Leu Asn Tyr He 



Glu Val Glu Leu 
15 

Phe Asn Thr Leu 
30 

Thr Pro Leu Lys 
45 

He Leu Tyr Arg 



Phe Val Phe Asp 
80 

Glu Arg Lys Lys 
95 

Leu Gin Ala Gly 
110 

Arg Val Asp Glu 
125 

Met Gly He Pro 



Ala Tyr Met Ala 
160 

Tyr Asp Ser Leu 
175 

He Thr Gly Lys 
190 

Lys Pro Glu He 
205 

Thr Arg Glu Gin 



Asn Glu Gly Val 
240 

Lys Thr Tyr Gly 



126 



245 



250 



255 



10 



15 



Asp lie Phe Arg Ala Leu Lys Ala Leu Lys Val Asn lie Asp His Val 
260 265 270 

Glu Glu lie Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 

325 330 335 



20 



2 5 :: „ 



3 0 ! 



35 



3 



45 



50 



<210> 31 
<211> 27 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (17) 
<220><221> misc_feature<222> (27) 
<400> 31 

atctctagca ctgctgtntt ygayggn 

<210> 32 
<211> 31 
<212> DNA 
<213> synthetic 

<220><221> misc_f eature<222> (21) . 

<220><221> misc_feature<222> (27) . 
<400> 32 

gatctctagc actgctgarg gngargcnca r 



. (18) <223> n 
.(28)<223> n 



degeneracy 
degeneracy 



. (22) <223> n 
. (28) <223> n 



degeneracy 
degeneracy 



27 



31 



55 



60 



<210> 33 

<211> 28 

<212> DNA 

<213> synthetic 

<400> 33 
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gatctctagc actgctcarg aytaygay 

<210> 34 

<211> 31 

<212> DNA 

<213> synthetic 

<22 0><221> misc__f eature<222> 

<220><221> misc feature<222> 



(19) . . (20) <223> n 
(25) . . (26) <223> n 



degeneracy 
degeneracy 



<400> 34 

cttaaggtag gactacytgn gcytcnccyt c 

<210> 35 
<211> 30 
<212> DNA 
<213> synthetic 
<400> 35 

ttaaggtagg actacytcrt aytcytgrct 

<210> 36 
<211> 30 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (27) . . (28) <223> n = degeneracy 
<400> 36 

ttaaggtagg actacytcrt aytcytgnga 

<210> 37 
<211> 30 
<212> DNA 
<213> synthetic 

<22 0><221> misc_f eature<222> 

<220><221> misc_f eature<222> 
<400> 37 



;24) . . (25) <223> n = degeneracy 
[27) . . (28) <223> n = degeneracy 
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ttaaggtagg actacrttrw artcngtnc 

<210> 38 
<211> 16 
<212> DNA 
<213> synthetic 
<400> 38 
gatctctagc actgct 

<210> 39 
<211> 17 
<212> DNA 
<213> synthetic 
<400> 39 

ccttaaggta ggactac 

<210> 40 
<211> 27 
<212> DNA 
<213> synthetic 
<400> 40 

tatcgcagcg atccacttct cctctgc 

<210> 41 
<211> 27 
<212> DNA 
<213> synthetic 
<400> 41 

cttaaacggc aacctgagaa ggcttgg 

<210> 42 

<211> 28 

<212> DNA 

<213> synthetic 



<400> 42 

ctatctcctt ctgcttgaaa acaggagg 

<210> 43 
<211> 27 
<212> DNA 
<213> synthetic 
<400> 43 

acaagggaac agctcgtcga tatcgcg 

<210> 44 
<211> 32 
<212> DNA 
<213> synthetic 
<400> 44 

taacgaattc ggtgcagaca taggcgaact ac 

<210> 45 
<211> 33 
<212> DNA 
<213> synthetic 
<400> 45 

cggtgtcgac tcaggaaaac cacctctcaa gcg 



<210> 


46 


<211> 


37 


<212> 


DNA 


<213> 


synthetic 


<400> 


46 



cacaggaaac agaccatggg tgcagacata ggcgaac 

<210> 47 
<211> 1017 
<212> DNA 

130 





<213> Ave 








<400> 47 






5 


atgggtgcag 


acataggcga 






gggagaaaaa 


tagc tattga 


4— s~rs-i 4— -t— 4— 4~ -n /*■< 


10 


caacctgacg 
atcctgtacc 


gcactccttt 
gcgugrcaaa 


il-ll. >->v y Wm 4— 4— 

gaaggauucu 
car.ga ucy ay 




ggtgagcctc 


ctgttttcaa 


gcagaaggag 


15 


gcagaggaga 


agtggaucgc 


tycy dUciy c±y 




caggcagcgg 


cgagggttga 




20 


atgggagttc 
gcgaagggcg 


catgggttca 

J— 4— +-4—4— 

atgtagattt 


ggcgccgagu 
uaccggcccg 




ccaaagcttg 


caagaaatct 


cgcgau tacu 




tacgttgagg 


tcaaaccaga 


gataatagac 




acaagggaac 


agctcgtcga 


tatcgcgttg 


"si 
3Q ! 


aagggcgttg 


gggtcaagaa 


ggccuacaag 


gctctcaagg 


ccttaaaggt 


agagcaggag 




aacccgcctg 


ttacgaacaa 


ctacagcctc 


3^ 


atcgagttcc 


tgtgtgaaga 


gcacgacttt 


; A;:. 
:! 

■! ::-^r 


aagctgaaag 


caggaatgca 


agcctcgcaa 




<210> 48 








<211> 337 






45 


<212> PRT 
<213> Ave 
<400> 48 







5 0 Met Gly Ala Asp He Gly Glu Leu 

1 5 



Glu Tyr Phe Ser Gly Arg Lys He 
20 

55 

Tyr Gin Phe He Ser He He Arg 
35 40 

Asp Ser Gin Gly Arg Met Thr Ser 
60 50 55 

Val Ser Asn Met He Glu Val Gly 



agagaagaag 


ttgaacttga 


gtacttctcc 


60 


actctttacc 


aqttcatatc 


tatcataagg 


120 


caggg t agaa 


tgacctcaca 


cctctccggc 


180 


gttggaatga 


gacccatttt 


cgttttcgat 


240 


atagaggaac 


qaaaggaaag 


aagagctgaa 


300 


agaggagaga 


agtacgcaaa 


gaagtacgct 


360 


qtcqaqtcqt 


caaagaagct 


gcttgagtat 


420 


a a acrcr a cr a crcr 


cacaggctgc 


atacatggca 


480 


c acrcr a 1 1 a c cr 


actcgcttct 


cttcqgcagc 


540 


crcr a a acr a crcr a 


acrctqcccqcr 


aaagaatgtt 


600 


1 - t aaacocrca 


acctgagaag 


qcttqqaata 


660 


ctcgtgggaa 


cggactacaa 


cgaaggagtg 


/Z\J 


tacataaaaa 


cctacggaga 


tgttttcaaa 


780 


aacatagagg 


agataagaaa 


cttcttcctg 


840 


cacttcggaa 


agccagacga 


tgagaagatt 


900 


agcaaggata 


gggtagagaa 


ggccgttgag 


960 


tcaacgcttg 


agaggtggtt 


ttcctga 


1017 



Leu Glu Arg Glu Glu Val Glu Leu 
10 15 

Ala He Asp Ala Phe Asn Thr Leu 
25 30 

Gin Pro Asp Gly Thr Pro Leu Lys 

45 

His Leu Ser Gly He Leu Tyr Arg 
60 

Met Arg Pro He Phe Val Phe Asp 
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65 70 75 80 

Gly Glu Pro Pro Val Phe Lys Gin Lys Glu He Glu Glu Arg Lys Glu 

85 90 95 

Arg Arg Ala Glu Ala Glu Glu Lys Trp He Ala Ala He Glu Arg Gly 
100 105 HO 

Glu Lys Tyr Ala Lys Lys Tyr Ala Gin Ala Ala Ala Arg Val Asp Glu 
115 120 125 

Tyr He Val Glu Ser Ser Lys Lys Leu Leu Glu Tyr Met Gly Val Pro 
130 135 140 

Trp Val Gin Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Asp Phe Thr Gly Ser Gin Asp Tyr Asp Ser Leu 

165 170 175 

Leu Phe Gly Ser Pro Lys Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Glu Val Lys Pro Glu He 
195 200 205 

He Asp Leu Asn Gly Asn Leu Arg Arg Leu Gly He Thr Arg Glu Gin 
210 215 220 

Leu Val Asp He Ala Leu Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Tyr Lys Tyr He Lys Thr Tyr Gly 

245 250 255 

Asp Val Phe Lys Ala Leu Lys Ala Leu Lys Val Glu Gin Glu Asn He 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asn Asn Tyr 
275 280 285 

Ser Leu His Phe Gly Lys Pro Asp Asp Glu Lys He He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Lys Asp Arg Val Glu Lys Ala Val Glu 
305 310 315 320 

Lys Leu Lys Ala Gly Met Gin Ala Ser Gin Ser Thr Leu Glu Arg Trp 

325 330 335 

Phe Ser 



<210> 49 

<211> 53 

<212> DNA 

<213> synthetic 

<400> 49 
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cccgtctcgc tggtgaaaag aaaaaccacc ctggcgccca atacgcaaac cgc 



<210> 50 

<211> 26 

<212> DNA 

<213> synthetic 

<220><221> misc_feature<222> 

<22 0><221> misc_f eature<222> 

<22 0><221> misc_f eature<222> 

<400> 50 

tntnccagag cctaatttgc cagtna 

<210> 51 

<211> 26 

<212> DNA 

<213> synthetic 

<220><221> misc_f eature<222> 

<22 0><221> misc_f eature<222> 

<22 0><221> misc_f eature<222> 

<400> 51 

tntnccagag cctaatttgc cagtna 

<210> 52 

<211> 24 

<212> DNA 

<213> synthetic 

<22 0><221> misc_f eature<222> 

<400> 52 

ttccagagcc taatttgcca gtna 

<210> 53 
<211> 24 
<212> DNA 



(22) . . (23) <223> misc. feature 

(1) . . (2) <223> misc. feature 

(2) . . (3) <223> misc. feature 



(1) . . (2) <223> misc. feature 

(2) . . (3) <223> Misc. Feature 
(22) . . (23) <223> Misc. Feature 



;22) . . (23) <223> misc. feature 
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<213> synthetic 

<220><221> misc_feature<222> (22) . . (23) <223> misc. feature 

5 <400> 53 

ttccagagcc taatttgcca gtna 

10 <210> 54 

<211> 25 

<212> DNA 

15 

<213> synthetic 

<400> 54 

2 0 cttaccaacg ctaacgagcg tcttg 

<210> 55 

2^, <211> 14 

k§ <212> DNA 

<213> synthetic 

3 0."" 

y <400> 55 

[y gctcccgcag acac 

^ <210> 56 
F|| <211> 15 
40:::? <212> DNA 

<213> synthetic 

<220><221> misc_feature<222> (1) . . (2 ) <223 > misc. feature 
<400> 56 
tntacgccac cagct 

50 

<210> 57 
<211> 12 
55 <212> DNA 

<213> synthetic 
<400> 57 

60 

cgctgtctcg ct 



45 
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<210> 58 

<211> 19 

<212> DNA 

<213> synthetic 

<400> 58 

gctcaaggca ctcttgccc 

<210> 59 

<211> 63 

<212> DNA 

<213> synthetic 

<400> 59 

atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 
ata 

<210> 60 

<211> 45 

<212> DNA 

<213> synthetic 

<400> 60 

ttttttttta attaggctct ggaaagacgc tcgtgaaacg agcgt 

<210> 61 

<211> 14 

<212> DNA 

<213> synthetic 

<400> 61 
cttcggagtt tggg 

<210> 62 

<211> 16 

<212> DNA 

<213> synthetic 
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10 



15 



20 



25;,** 



3 0, ; 



35" 



4QS 



45 



50 



55 



60 



<220><221> misc_feature<222> (1) . . (2 ) <223> misc. feature 
<400> 62 
ancttcggag tttggg 

<210> 63 
<211> 16 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (1) . . (2) <223> misc. feature 

<400> 63 
cncttcggag tttggg 

<210> 64 
<211> 16 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (1) . . (2) <223> misc. feature 
<400> 64 
gncttcggag tttggg 

<210> 65 
<211> 16 
<212> DNA 
<213> synthetic 
<22 0><221> misc_f eature<222> 
<400> 65 
tncttcggag tttggg 

<210> 66 
<211> 25 
<212> DNA 
<213> synthetic 
<400> 66 

gggttgtgga gtgagtgttc aagta 



(1) . . (2) <223> misc. feature 
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<210> 67 
<211> 27 
<212> DNA 
<213> synthetic 
<400> 67 

ccatcctaat acgactcact atagggc 

<210> 68 
<211> 21 
<212> DNA 
<213> synthetic 
<400> 68 

ctcatacagt tacttgtctt c 

<210> 69 

<211> 489 

<212> RNA 

<213> Homo sapiens 

<400> 69 

gaacucacua uagggcucga gcggccgccc gggcaggucc gccaccaaaa ugcagauuuu 
cgugaaaacc cuuacgggga agaccaucac ccucgagguu gaacccucgg auacgauaga 
aaauguaaag gccaagaucc aggauaagga aggaauuccu ccugaucagc agagacugau 
cuuugcuggc aagcagcugg aagauggacg uacuuugucu gacuacaaua uucaaaagga 
gucuacucuu caucuugugu ugagacuucg ugguggugcu aagaaaagga agaagaaguc 
uuacaccacu cccaagaaga auaagcacaa gagaaagaag guuaagcugg cuguccugaa 
auauuauaag guggaugaga auggcaaaau uagucgccuu cgucgagagu gcccuucuga 
ugaauguggu gcuggggugu uuauggcaag ucacuuugac agacauuauu guggcaaaug 
uugucugac 

<210> 70 
<211> 52 
<212> DNA 
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3 0'- 



50 



55 



<210> 73 

<211> 16 

<212> DNA 

60 <213> synthetic 

<400> 73 



52 



52 



<213> synthetic 
<400> 70 

5 ggaatacgac tcactatagg gaaagtctct gccgcccttc tgtgcctgct gc 

<210> 71 
10 <211> 52 

<212> DNA 
<213> synthetic 

15 

<400> 71 

ggaatacgac tcactatagg gaaagtctct gccgcccttc tgtgcctgct gc 

20 

<210> 72 
<211> 647 

25 <212> RNA 

<213> synthetic 
<400> 72 

gggaaagucu cugccgcccu ucugugccug cugcucauag cagccaccuu cauuccccaa 
M gggcucgcuc agccagaugc aaucaaugcc ccagucaccu gcuguuauaa cuucaccaau 
3^3 aggaagaucu cagugcagag gcucgcgagc uauagaagaa ucaccagcag caaguguccc 
Q aaagaagcug ugaucuucaa gaccauugug gccaaggaga ucugugcuga ccccaagcag 24 0 
aaguggguuc aggauuccau ggaccaccug gacaagcaaa cccaaacucc gaagacuuga 3 00 
acacucacuc cacaacccaa gaaucugcag cuaacuuauu uuccccuagc uuuccccaga 
i*f cacccuguuu uauuuuauua uaaugaauuu uguuuguuga ugugaaacau uaugccuuaa 
45 guaauguuaa uucuuauuua aguuauugau guuuuaaguu uaucuuucau gguacuagug 

uuuuuuagau acagagacuu ggggaaauug cuuuuccucu ugaaccacag uucuaccccu 
gggauguuuu gagggucuuu gcaagaauca uuaauacaaa gaauuuuuuu uaacauucca 
augcauugcu aaaauauuau uguggaaaug aauauuuugu aacuauu 



60 
120 
180 



360 
420 
480 
540 
600 
647 
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ttcttcggag tttggg 
<210> 74 

5 

<211> 26 
<212> DNA 
10 <213> synthetic 

<400> 74 

ccgtcacgcc tccttcggag tttggg 

15 

<210> 75 
<211> 24 

20 

<212> DNA 
<213> synthetic 
25 <400> 75 

aacccaaact ccgaaggagg cgtg 



3 &4 <210> 76 

i : i <211> 29 

W <212> DNA 

<213> synthetic 

<400> 76 

4(p gcgcagtgag aatgaggagg cgtgacggt 

^ <210> 77 
45 <211> 17 

<212> DNA 
<213> synthetic 

50 

<220><221> misc_feature<222> (1) . . (2) <223> misc. feature 
<400> 77 
55 cnctcattct cagtgcg 

<210> 78 
60 <211> 30 

<212> DNA 



<213> synthetic 
<400> 78 

aacgaggcgc acctttacat tttctatcgt 

<210> 79 
<211> 24 
<212> DNA 
<213> synthetic 
<400> 79 

ccttccttat cctggatctt ggca 

<210> 80 
<211> 24 
<212> DNA 
<213> synthetic 
<400> 80 

acgatagaaa atgtaaaggt gcgc 

<210> 81 
<211> 29 
<212> DNA 
<213> synthetic 
<400> 81 

cggaagaagc aagtggtgcg cctcgttaa 

<210> 82 
<211> 16 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (1) . . (2) <223> misc. feature 
<400> 82 
gncacttgct tcctcc 

<210> 83 

140 



10 



15 



; 0 



<211> 14 
<212> DNA 
<213> synthetic 
<400> 83 

gctcccgcag acac 14 



<210> 84 

<211> 30 

<212> DNA 

<213> synthetic 

20 <400> 84 

caaagaaaag ctgcgtgatg atgaaatcgc 30 

25 <210> 85 

o 

H-5 <211> 50 

... : <212> DNA 

<213> synthetic 



<400> 85 



35- 5 ^ gaaggtgtct gcgggagccg atttcatcat cacgcagctt ttctttgagg 50 

b 

m 

a 
m 
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